LADA-Trans-NER: Adaptive Efficient Transformer for Chinese Named Entity Recognition Using Lexicon-Attention and Data-Augmentation
نویسندگان
چکیده
Recently, word enhancement has become very popular for Chinese Named Entity Recognition (NER), reducing segmentation errors and increasing the semantic boundary information of words. However, these methods tend to ignore relationship before after sentence integrating lexical information. Therefore, regularity length not been fully explored in various word-character fusion methods. In this work, we propose a Lexicon-Attention Data-Augmentation (LADA) method NER. We discuss challenges using existing incorporating NER show how our proposed could be leveraged overcome those challenges. LADA is based on Transformer Encoder that utilizes lexicon construct directed graph fuses through updating optimal edge graph. Specially, introduce advanced data augmentation obtain representation task. Experimental results done can considerably boost performance system achieve significantly better than previous state-of-the-art variant models literature four publicly available datasets, namely Resume, MSRA, Weibo, OntoNotes v4. also observe generalization application real-world setting from multi-source complex entities.
منابع مشابه
POLYGLOT-NER: Massive Multilingual Named Entity Recognition
The increasing diversity of languages used on the web introduces a new level of complexity to Information Retrieval (IR) systems. We can no longer assume that textual content is written in one language or even the same language family. In this paper, we demonstrate how to build massive multilingual annotators with minimal human expertise and intervention. We describe a system that builds Named ...
متن کاملChinese Named Entity Recognition Using Role Model
This paper presents a stochastic model to tackle the problem of Chinese named entity recognition. In this research, we unify component tokens of named entity and their contexts into a generalized role set, which is like part-of-speech (POS). The probabilities of role emission and transition are acquired after machine learning on a role-labeled data set, which is transformed from a hand-correcte...
متن کاملNamed Entity Recognition in Persian Text using Deep Learning
Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...
متن کاملHybrid Models for Chinese Named Entity Recognition
This paper describes a hybrid model and the corresponding algorithm combining support vector machines (SVMs) with statistical methods to improve the performance of SVMs for the task of Chinese Named Entity Recognition (NER). In this algorithm, a threshold of the distance from the test sample to the hyperplane of SVMs in feature space is used to separate SVMs region and statistical method region...
متن کاملPOSBIOTM-NER: a trainable biomedical named-entity recognition system
SUMMARY POSBIOTM-NER is a trainable biomedical named-entity recognition system. POSBIOTM-NER can be automatically trained and adapted to new datasets without performance degradation, using CRF (conditional random field) machine learning techniques and automatic linguistic feature analysis. Currently, we have trained our system on three different datasets. GENIA-NER was trained based on GENIA Co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i11.26554